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Abstract 

In this paper we study time-consistent risk measures for returns that are given by a GARCH(1,1) 
model. We present a construction of risk measures based on their static counterparts that over¬ 
comes the lack of time-consistency. We then study in detail our construction for the risk measures 
Value-at-Risk (VaR) and Average Value-at-Risk (AVaR). While in the VaR case we can derive an 
analytical formula for its time-consistent counterpart, in the AVaR case we derive lower and up¬ 
per bounds to its time-consistent version. Furthermore, we incorporate techniques from Extreme 
Value Theory (EVT) to allow for a more tail-geared statistical analysis of the corresponding risk 
measures. We conclude with an application of our results to a data set of stock prices. 
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1 Introduction 


In the wake of the financial crisis risk management constitutes a constant active field that attracts 
both mathematical research and quantitative requirements for the practical implementation. Most 
financial institutions need to abide with the Basel II/III accords that prescribe certain risk manage¬ 
ment rules to be applied to internal risk control and that are under periodic regulatory supervision. 
Over the last two decades the key notion of risk management arose in the form of a risk measure 
referred to as Value-at-Risk (VaR). Simply put, VaR determines the risk capital of a financial in¬ 
stitution as the quantile of a profit-and-loss distribution with respect to some prescribed (either by 
regulation or by internal rules) time horizon and confidence level. An axiomatic approach to the 


field of risk measures is given by Artzner et al. (1999) in which the notion of the coherent risk mea¬ 


sure is introduced and where it has been realized that VaR does not always satisfy the property of 
coherence. Artzner et al. (1999) introduce a risk measure that amends the lack of coherence that 


is nowadays known as the Average-Value-at-Risk (AVaR). An extension to convex risk measures is 
given in Follmer and Schied (2002), which integrates existing notions of risk into the mathematical 


framework of convex dual theory and, hence, allows for deep and powerful dual characterizations. 
In order to account for the dynamic stochastic evolution of profit-and-loss positions the static risk 
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measurement has been extended to the class of dynamic risk measures, which treats the risk measure 
not only as a (nonlinear) expectation but as a stochastic process, see e.g. Detlefsen and Scandolo 


(2005) and Riedel (|2004 ) for the extension to the dynamic setting by means of convex dual theory. It 


has been realized in this dynamic framework that most existing static risk measures do not transfer in 
a straightforward manner into processes without violating the required property of time-consistency. 
A time-consistent dynamic risk measure secures the consistent behavior of a risk measure that, if a 
portfolio is riskier than another portfolio at some future time, then this portfolio has been riskier that 
the other portfolio at any time before. The literature on time-consistency of risk measures is diverse 
and rich as different mathematical viewpoints can be adopted to prevent the consistency property. 
An incomplete chronicle of research done in the field of time-consistent risk measures includes |Peng| 

Follmer and Pennerj (|2006[), 


(2004 

), 

o 

o 



Detlefsen and Scandolo 


(2005), Weber 


Roorda and Schumacher (2007), Penner (2007), Bion-Nadal (2009), and Bielecki et al. ( 2015| ). A 


(2006) 


major result from the research on time-consistency reveals that in the class of law-invariant risk 
measures there is only one risk measure that, upon transfer into a time-dynamic process setting, 
supports time-consistency, namely the entropic risk measure (cf. Follmer and Knispel ( 2011[ )). 

In parallel to the aforementioned theoretical work statistical models and methods have been developed 
to calibrate and integrate risk measures to real world data. As the industry standard VaR and its 
coherent counterpart AVaR are law-invariant risk measures, the main goal for an implementation 
of (A)VaR is to find a good estimate of the profit-and-loss distribution in the relevant region. In 
this field, the major class of estimation methods comprise the historical simulation method, methods 
based on Gaussian distribution assumptions and methods based on Extreme Value Theory (EVT). 


We refer to McNeil et al. (2005), in particular Chapter 2 and Chapter 7, for a detailed account and 


references to methods of profit-and-loss distribution estimation. More background on extreme value 


theory can be found in the monograph Embrechts et al. (1997). McNeil and Frey (2000) propose 


an implementation of VaR and AVaR that is based on an estimation of the log-returns distribntion 
using a combination of a GARCH(1,1) model fit and an EVT approach for the residuals. Their 
method proceeds in a two-step scheme: first, the GARCH(1,1) model mimics the inherent stochastic 
volatility of financial time series, and the GARCH parameters are estimated by a pseudo maximum 
likelihod method. Second, they adopt a Peaks-over-Threshold (POT) approach to the residnals 
and only consider those residuals that exceed a critical value. The POT method justifies fitting a 


Generalized Pareto distribution (GPD) by means of a maximum likelihood method (e.g. Embrechts 


et al. (1997), Section 3.4 and Section 6.5) It is also in accord with the typically high confidence levels 
that are imposed on (A)VaR to zoom into the extreme branch of losses. Applying the POT method 
to the residnals rather than directly to the log-returns has the advantage that the fitting procedure 
to the extremes only needs to be applied once due to the white noise property of the residuals. Using 


these two steps, McNeil and Frey (2000) succeed to estimate (A)VaR by fitting a distribution that 


adequately accounts for the extremes in the tail and under mild conditions allows for closed form 
formulas for VaR and AVaR. 

The goal of onr paper is to incorporate dynamic time-consistency for VaR and AVaR. We investigate 
the extension of static risk measures to dynamic counterparts that satisfy time-consistency. A key 
property to succeed in this transfer is the dynamic programming principle, see [Cheridito and Stadje 


(2009), Cheridito and Kupper (2011). 


The two-step estimation scheme from McNeil and Frey 


using GARCH(1,1) and EVT allows 
us to derive a closed form expression for the dynamic time-consistent VaR that is easily implemented 
using the estimated GPD and the GARCH parameters. For AVaR however, such a closed form 
expression cannot be obtained and we derive closed form lower and npper approximations to AVaR. 
On top of being more conservative than their static counterparts, the dynamic time-consistent VaR 
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offers the benefit that the risk measurement of aggregated losses, which in e.g. [McNeil and Frey 


(2000) have to be estimated by simulation methods, can now be estimated in a (semi-)closed way by 
simply aggregating the VaRs of the single positions at different future time points. 


The paper is structured as follows. In Section we present preliminaries on dynamic risk mea¬ 
sures along with the dynamic programming principle characterization. Moreover, we introduce the 
GARCH(1,1) loss model, which establishes the model framework for the entire paper. In Section]^ 
we apply the new methodology from the previous section to derive a closed form expression for the 
time-consistent VaR and investigate its properties concerning the evolution over time and prove the 
linearization of aggregated losses. Section is devoted to the study of AVaR. Since a closed form 
expression for time-consistent AVaR is not possible, as an alternative, we derive closed form expres¬ 
sions for pragmatic bounds to AVaR and study the properties as in the previous section. The proofs 
of the results of Sections 3 and 4 are postponed to the Appendix. In the last Section we give a 
rehash on the part of extreme value theory that is relevant for our purpose, and apply our results to 
a data set of stock prices. 


2 Conditional risk measures 


Given a probability space {Q,!F,F) we consider a filtration where T e N. We denote by 

LP{J-t) with t e {0,... ,T} the set of all .T^t-measurable random variables A : 11 ->• M. In this paper, 
the space represents the space of all financial positions for which we need a risk assessment. 

Typically, we will be interested in losses, i.e. the negatives of log-returns of financial data. 

Since conditional risk measures are random variables, all properties, equalities and inequalities below 
hold almost surely with respect to P, and we assume this throughout without making extra mention 
of it. 


Definition 2.1. For t e {0,...,T} a family of mappings {4>t)tLo with ft '■ L^{Tt) is a 

dynamic monetary risk measure if it satisfies the following properties: 


(i) Normalization: ftiO) = 0 for t = 0,... ,T; 

(a) Monotonicity: </>t(A) > 0t(V) for all A, V e L^{!Ft) such that A > V, for t = 0,... ,T; 


(Hi) Translation invariance: <?i>t(A + m) = 4>t{X) + m for all X e L^{!Ft) o,nd m e LP{!Ft), for 

t = o,...,r. 

If represents the space of all profit and loss variables, the above definition leads to the notion 

of a dynamic monetary utility function, see Definition 2.1 in Gheridito and Kupper (2011). If a 
dynamic monetary risk measure f satisfies in addition to Definition [2.1 j (i)-(iii) 

• Positive homogeneity: ft{^X) = \ft{X) for all A e L^{Ft) and A > 0, for t = 0,..., T; 

• Subadditivity: ftiX + V) < ft{X) + 4>t{Y) for all A, V e for t = 0,..., T, 


then we say that 0 is a coherent (dynamic monetary) risk measure. 

Definition 2.2. A dynamic monetary risk measure f := is time-consistent if 


ft+iiX) > ft+iiY) implies ft{X)>cl)t{Y), 


for all X,Y€L^{Ft), for t = 0,... ,T - 1. 


The following useful characterization of time-consistency can be found in Gheridito and Kupper 

dMTl). 
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Proposition 2.3. A dynamic monetary risk measure ((/>t)^Q is time-consistent if and only if it 
satisfies the Bellman principle 


M^) = t+i (X)) 


( 2 . 1 ) 


for all X 6 LP{!Ft), and t = 0,... ,T - 1. 


It has been noted in Cheridito and Stadje (2009) and Cheridito and Kupper (2011) that there is 
another way to construct time-consistent dynamic risk measures: let {pt)lSQ be an arbitrary dynamic 
monetary risk measure 


pt-.L\TT)-L\Tt), t = 0,. 


,r-i, 


then the backward iteration 


MX)--=X, MX)--=Pt{cl^ui{X)), t = 0,...,T-l, (2.2) 

defines a process ((/>t)^g which by definition is a time-consistent dynamic risk measure. The following 
property is a straightforward consequence of the construction of {4>t)f=o- 

Corollary 2.4. ForX€L^{iFT) we have for t = - 1 

(t)t{X) = {^pt o pui o ■■■ o pT-i'^{X). (2.3) 

For a coherent risk measure cj), its subadditivity property implies that for any fixed t e {0,..., T} and 
m e N such that t + m<T and any Xt^f. e for k= 1,..., m we have 

m m 

cl>t{Z^t^k)<ZMXuk)- (2.4) 

k=l k=l 

We construct our time-consistent dynamic risk measures by backwards iteration. 

2.1 The GARCH(1,1) model for loss positions 

Recall that we are interested in the risk assessment of losses. The focus of this paper is on a 
particular class of loss processes its dynamics is governed by a GARCH(1,1) process and 

typically represent (negative) log-returns. It holds that (Tt)^i satisfies 


Tt — CfZf, 

a'f = ao + aiL^_i + ba^_i, 


(2.5) 


where ao,ai,6 > 0 are the model parameters, uo and Lq are .To-measurable initial random variables, 
and is a strict white noise process (independently identically distributed with zero mean and 

unit variance). Note also that by (2.5) at is measurable with respect to Xt-i for every t = 1,... ,T. 

We denote by : M ^ [0,1] and : [0,1] - 
quantile function of each Zt, respectively; i.e.. 


the distribution function and the left-continuous 


Fz{z) =^{Zt < z), F^^(q;) = inf{x e M : F^(x) > a}, a €(0,1), t = 0,, 


( 2 . 6 ) 


For properties of the quantile function F^^ we refer to Resnick (1987), Section 0.2, or Embrechts 


et al. (1997), Proposition A1.6. 
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We assume that Fz is strictly increasing, thus is continuous, and that the right endpoint of Zt 
is inhnite; i.e., 

xp = inf{x e M : Fz{x) = 1} = oo. 

If necessary we identify F^^{1) with xp = oo. Since Z has infinite right endpoint, and a is close 
to 1, F'^^{a) is as a rule positive. We shall also need the quantile function of Z"^ and note that for 
Z symmetric, F^\{a) = F^^{^{a + 1))^. Note further that a < |(a; + 1) for a e (0,1), hence (for 
F-^\a) > 1 ) 

F-z\a) < F-z\af < F^^^ia + l))^ = F^i(a). (2.7) 

We summarize the assumptions which we will assume throughout the paper. 

Assumptions A: We assume that Fz is strictly increasing with support M and that F^^{a) > 0. 
For simplicity, we also assume that Z is symmetric. 

Since we often work with distribution tails, we note that F'^^ can also be represented as 

F 2 ^(a) = inf{x e M : P(Zi > x) < 1 - a}, ae(0,1), t = 0,...,T. (2.8) 


3 Conditional time-consistent Value-at-Risk 


In this section we study Value-at-Risk (VaR) in the framework of dynamic time-consistent risk 
measures. One typically considers L e LP{Ft) which represents a possibly large loss position, for 
which the probability of L exceeding a loss threshold m > 0 should be bounded by a small probability 
1 - a, i.e. a is typically close to 1. The smallest loss threshold m which satisfies this bound is the 
VaR“. Several versions of the (conditional) VaR definition can be found in the literature. In analogy 


to (2.8) we work throughout with the following, which caters best to the purpose of the treatments 


in this paper. 

Definition 3.1. Given a loss position L e LP{Ft) the Value-at-Risk at level a e (0,1) at time 
t e {0,..., T} for L is defined by 


VaR^(L) := essinf{m e L^{Ft) ■ F{L<m\ Ft) > a}, 


(3.1) 


3.1 Time-consistent VaR for single day losses 


We start this section with the following example, which is the core object of interest in McNeil and 


Frey (2000). 


Example 3.2. For t = 0,... ,r- 1 let Lt+i be given by (2.5). Then YaRf{Lt+i) is the 1-day-ahead- 
VaR, which can be computed straightforwardly as 

VaR“(Lt+i) = essinf{m e LP{Ft) ■ F{at+iZt+i <m \ Ft)> a}. 

Since at+i is .7)-measurable, we also have that m ■■= mjat+i is .Ft-measurable. Using the independence 


between Zt+i and Ft, and also (2.8), we can continue 


VaR"(Lf+i) = (Ti+iessinf{m e L^iFt) ■ F{Zt+i <m\ Ft)>a} 
= at+i inf {m e M+ ; F{Zt+i < rh) > a} 
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= at+iFz^{a). 


( 3 . 2 ) 


Moreover, since Lt^ Ft, 

VaR“(L4 + Lui) = Lt + VaR“(Ri+i) = U + auiFz\a). 


(3.3) 

□ 


There are examples showing that Value-at-Risk from Definition 3.1 is not time-consistent (e.g. 


Cheridito and Stadje (2009) or Follmer and Schied (2011, Example 11.13)). As the GARCH(1,1) 


model (2.5) is defined by an iteration, one could hope that for this specific model VaR“ is time- 
consistent. However, this is not true and we provide a counterexample, which makes use of Proposi¬ 
tion! 


Example 3.3. In order to see why in the framework of GARCH(1,1) losses VaR cannot be time- 
consistent, recall that according to Proposition 2.3 VaR“ is time-consistent if and only if it satisfies 
the dynamic programming principle 

VaR“ = VaR" o VaR" i, t = 0,..., T - 1. 

For t e {0,..., T - 2}, by (3.2), we have YaK^^i{Lt+ 2 ) = o't+ 2 F^^{a) and, hence, 

VaR"(VaR" i(Lt+2)) = VaR"(at+2i^z^(«))- 

We compute VaRf(Lt+ 2 ) and YaRf{(Tt+ 2 F^^{a)) for the GARGH(1,1) model: 

VaR"(cJt+ 2 Pz^(a)) = m* = essinf{m e L^{Ft) '■ P((Ti+ 2 Pz^(a) < m \ Ft) > a} 

= essinfjm e L^{Ft) ■ 1P(\/ oo + o't+ii^^^t+i ^ I ^t) - “}• 

Since the function yj (ao + + b))F^^{a)‘^ is strictly increasing in and .Ti-measurable, 

we obtain 

m* = yj (ao + o-^^^(aiF-i(a) + b))Fz^{a). (3.4) 

Next we compute 

VaR"(Lt+ 2 ) = m** = essinfjm e LP{Ft) ■ f‘[at+ 2 Zt +2 < rn \ Ft) > a} 

= essinf{m e L^{Ft) ■ uq + cr‘f^i{aiZ'f^^ + b)Zt +2 < m \ Ft) > a). 

Now assume that m** = m* for all a e (0,1). We denote by Pj the conditional probability with 
respect to Ft and calculate 


a = ^*(’^00 + -H b) < yjao + al^^{aiF^l{a) +b)) 

= Pt(\/ao + (^t+ii^^^t+i + b)Zt+2 < \Jao + a‘^^^{aiF-l{a) + h)Fz^{a)) 

ao + + b)z < yJao + a^^^{aiF^l{a) + b)Fz^{a))dFz{z) - 1 

= 2^ ^ Pt(\/ao + + b)z<yJao + a'^^-^{aiF-l{a) + b)Fz^{a))dFz{z) 

f Ft{yJao + cT2^i(aiZ2^^ + b)z<Jao + al^-^{aiF^l{a) + b)Fz^{a))dFz{z) - 


+ 2 
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Now note for the first integral that F^^{a)lz decreases in 2 ; to 0 and has minimum 1 over the integral 
range. This implies for the probability under the integral, that the left-hand random variable scaled 
by 2 ; decreases with 2 ; to a. Moreover, since the support of Zt+i has infinite right endpoint, the second 
integral is positive. Hence, we estimate the right-hand side by 

> 2 , 0 ? + 2(1 — 1 , 

where a > 0. However, for a close to 1 we have a + 1 < 2a^ + 2a. □ 


Cheridito and Stadje (2009) propose to amend the time-inconsistency of VaR using the backward 


iteration (2.2). This gives rise to the following definition. 

Definition 3.4. Given a loss position L e LP{Ft) o,nd VaR“ from Definition 
consistent Value-at-Risk at level a e (0,1) for L is defined by 


3.1 


Then the time- 


VaRr(T) := VaRr(L) = L, VaR“(L) := VaR"(VaR“ i(L)), t = 0,...,T-l. 


(3.5) 


In the notation of the construction from the recursion (2.2), this corresponds to pt ■= VaRf and 
(pt ■•= VaRj . As a consequence of the construction of we find for L e LP{1Ft) 


VaR, 


\L) = ( 


VaR“ o VaR: 


■t+i 


■■oVaRy_i 


Yl). 


(3.6) 


The choice of the GARCH(1,1) model (2.5) entails the convenient feature that the m-day ahead VaR 
assessment allows for a closed form solution. More precisely, we can derive an analytical solution for 
the time t risk assessment of the GARCH (1,1) loss at terminal time T as follows (as usual we set 
= 6). The proof is given in Appendix |A[ 

Theorem 3.5. Let he the loss process given by the GARCH(1,1) model (2.5). Then we have 

^“(Lt) = Fz\a)^V]^{aiF-l{a)+b), t = 0,...,T-l, (3.7) 

->• M is an .Tt-measurable mapping given by 


where VT 


rlix) := ao^~Z 

k^O 


t = 0, . 


,T-1. 


(3.8) 


3.2 Time-consistent VaR for aggregated losses 

We now come to the computation of the m-day-ahead VaR“. So far, we have considered risk positions 
at a fixed day T that is ahead of time t <T up to which information in the form of the filtration Ft is 
available. The m-day-ahead VaR" is a risk assessment of aggregated losses Lt^i~ from the time period 
[t + l,t + m] for t + m < T. Next we show that VaR linearizes across the aggregation of GARCH(1,1) 
losses. 

Proposition 3.6. Let {Lt)f^Q be the loss process given by the GARCH(1,1) model 
have for fixed te {1,...,T-1} and m e N such that t + m<T, 

^ Luk) = Fz\a) ^ ^ (3.9) 

fc=l k=l fc=l 


2.5), then we 
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Proof. First note that for m = 2 we know from (|3.3l) that 


VaR“(i4+i + Lu 2 ) = VaR“(Lt+i + VaR“ 

= VaRf (cTt+iZ^+i + at+ 2 Fz^{a)), 


where the second line follows from (3.2). By (2.5) we have (Tj ^2 = which transforms 

the last equation into 

+ Lu2) = VaR“(at+iZf+i + Fz\a)^ + h)). 


By the definition of VaRf and the fact that at+iZt+i + F{oi)\J + h) is a strictly 
increasing .Ft-measurable function of we find that 


VaR, 


'(Lt+i + Lu 2) = at+iFz^{a) + «o + (^ui^aiF^l{aY + b), 


(3.10) 


which is equal to the sum VaR^ {Ft+i) + VaR^ (Ri+ 2 ) and also equal to the corresponding sum in 
the center. We proceed by induction and assume that (|3.9[) is true for 'ETFi ^t+k- Since the sum is 


.F)+m-measurable, we obtain by (3.6) 


, m -1 


' k=l 


VaR“( g Luk) = (VaR“ o VaR“ 1 o o VaR“ ^_i)( ^ L^k + Lt+m) 

' ’ fc=i 

m -1 

,)( ^ Li+fc + VaR 


= (VaR“oVaR“i0...oVaR“,^_2j 


t+m- 


k=l 




VaR“(Lf+i + VaR“ i(L 4+2 + - + VaR“ ^_ 2 ( W-i) + VaR“ W))) 

VaR“(R+i + i( E Luk)) = VaR“(R+i + f VaR“ 


'k=2 k=2 

where the last identity follows by the induction hypothesis, which also implies 


fc =2 


lit _ lit , - 

^ VaR“ = F-z\a) ^ ^VlX\{a^F-l{a) + h) 

' ^ k=2 

m 

= Fz\a)Y, 


k=2 


N 


k-3 


ao E {aiF^l{a) + bf + cr2^2(®i-^z2(“) 
1=0 


We use cr ^^2 = ®o + (^t+ii^^^ui from (2.5) and observe that 


fc =2 \ 


k-3 


ao E {aiF^Ka) + bY + (ao + + b)^{aiF^l{a) + b) 

1=0 


k-2 


k=2 \ 


k~2 

ao E {aiF-j{a) + b)^ + (0-2^1(01^2^^ + 6))(aiF-J(a) + bf ^ 
1=0 


is a strictly increasing function in Z^i- Hence we can proceed by the same argument as in the pretext 


leading to (3.10) to achieve ultimately 


VaR“(Li+i + E VaR“ i(Li+fc)) 
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m 


at,iF-z\a) + F-z\a)Y. 


k=2 

m 


i 


k-2 


ao Y. {aiF^l{a) + b)^ + a‘^^^{aiF^l{a) + b) 


k-l 


j=0 


lit I- 

auiF^ia) + F-^ia) Y VPf"(«i^zK«) + b) 

k^2 

m I - 

F-Y{a) Y ^Vt^{a^F-l{a)+b) 


k=l 

'^OL , 


= ^VaRi(Li,fc). 

k=l 

This finishes the proof. □ 

4 Conditional time-consistent Average Value-at-Risk 

This section is devoted to the study of time-consistent alternatives for the Average Value-at-Risk 
(AVaR). Due to coherence AVaR is commonly considered as a more reasonable rectification of VaR. 
For more details we refer to Follmer and Schied (2011, Chapter 4). For t e {0,..., T} we define L^{Ft) 
as the set of all P^-integrable losses, where Pj denotes the conditional probability with respect to Ft 
and Ej the corresponding conditional expectation. The following definition relates AVaR to VaR. 

Definition 4.1. Given a loss position L e L^(Ft) the Average Value-at-Risk at level a e (0,1) at 
time te{0,...,r} is given by 


AVaRi“(L) = f\aRUL)du. 

1- a Ja 


(4.1) 


with VaRf (L) as in Definition 3.1 


Whereas VaR quantifies the risk associated to one particular level of risk, reflected in the choice of 
a, AVaR as an integrated VaR takes into account VaR at the entire bandwidth of risk levels between 
a and 1 and thus better reflects volume of extreme risks that VaR might neglect. 


The following is the analog of a fact well-known for unconditional AVaR (e.g. Lemma 2.16 of McNeil 


etlRl i pOO^ ). 

Remark 4.2. If the loss position L e L^{Ft) has a continuous distribution function, then 

AVaR)*(L) = E 4 [L I L>VaR^(L)], t = 0,...,r. (4.2) 


Due to (4.2), AVaR is often also referred to as conditional VaR or Expected Shortfall. 


Assumption B: Additionally to Assumptions A we require from now on also that Z has a 

continuous distribution function. 

4.1 Time-consistent AVaR for single day losses 


We focus again on the GARCH(1,1) model from (2.5). 


Example 4.3. Assume the setting as in Example 3.2 For t = 0,... ,T - 1 let Lt+i be given by 


(2.5). Then AVaRf(Lt+i) is the 1-day-ahead-AVaR. If the innovations {Zt)t>o have a continuous 
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distribution function Fz, then by linearity of the conditional expectation and J^t-measurability of 
(Tt+i we get for t = 0,..., T - 1, 


AVaR“(Li+i) = [Z^i \ > Fz\a)] 

1 r°° 

= crui- - ydFziy) 

1- a JF-^(a) 


at+iA\aR^{Z). 


This calculation can also be found in McNeil and Frey ( 2000[ ). 

In analogy to Section]^ a time-consistent version of AVaR is constructed as follows. 


Definition 4.4. Given a loss position L e L^{Ft) and the AVaRf(L) as in Definition 4-F Then 
the time-consistent Average Value-at-Risk at level a e (0,1) for L is defined by 


AVaR^(L):=L, AVaR"(L) := AVaR"(AVaR“+i(L)), t = 0,...,r-l. 


(4.3) 


For the Average Value-at-Risk of the squared loss at time T we can derive an explicit formula 


similar to (3.7). Note that, though AVaR of allows for an interpretation as the conditonal volatility 
at time T, our purpose of investigation is to employ AVaR of to derive pragmatic bounds to AVaR 


itself, see Section 4.2 below. 


We start with a result analog to Theorem 3.5, and recall that Y,k=o^k = 0- The proof is given in 
Appendix 


Theorem 4.5. Let (Tt)^o given by the GARCH(1,1) model (2.5). Then we have for the squared 
loss L‘^ e L^{Ft) at terminal time T 


AVaR? 


= ^ f'F^l{u)duVUa,:^ f'Fzl{u)du + b), t = 0,...,T-l, (4.4) 

1 - a Ja ^ 1 - a Ja f 


where VT '■ 


is an .T't-measurable mapping given by 
T-t-2 


vj(x) = oo XI ® + W+1^ 
fc =0 


T-t-1 


t = 0,...,T-l. 


It is also possible to derive expressions for m-day ahead AVaR . As usual we define nj=i aj = 1. 


Proposition 4.6. Let {Ltfl^Q be given by the GARCH(1,1) model ( |2.5[ ). For f > 0 define = aui 
and for fixed m>2 


m-2 k m-1 

Qt""izi,Z2,...,Zm-i) ■■= ao X YliaiZj + b) + H (aiZk + b). 

k=0 j=l j=l 


(4.5) 


Then AVaR^ (Tt+i) = AVaR“(Z) at+i and for fixed m> 2 
AVaR“(Z) 


AVaRj (T) 




T r" V ^ \lQf^{zi ,..., Zm-l)dFz.{z^)-dFz 2 {Zm-l). (4.6) 
JF }{a) JF f{a) 


(m-l)-times 
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Proof. For m = 1 note that by (4.1) AVaRj (Lj+i) = AVaRf(Lf+i) = as in 

Example 4.3, For simplicity we set k = /i^i(o) yd(y). For AVaR^(Lt+ 2 ) we use this and then 


Lemma F.l and compute for m = 2 


A^j {Lt+2) = AVaR“(A^i+i(Lf+2)) = AVaR“(AVaR“ i(Lt+2)) 
= AVaRf (Tiat+s) 

= KAVaRt“((Ji+2) 

= KEt[at+2 I crt+2 > VaR“+i((Tt+2)] 

= KEt[cJi+2 I ^4+1 > F^l(a)] 

= YTff ^t[\/ao + + b)l^zl^^F^l(a)}] 

~K r oo j - 

= 7— Li. Jao + (Tl^^{aiz + h)dFz2{z), 

i CF ^ Z ' ^ / 

since at+i is .Ft-measurable. Assume that ( |4.6[ ) holds for Lt+ 2 j ■ ■ ■ ,Lt+m-i- Then 
A^“(W) = AVaR“(A^“ i(( W))) 


(4.7) 

(4.8) 

(4.9) 


= AVaR"( 


(1-a) 




T«) 

(m-2)-times 


where 


^(t+l)+(m-l) 


m-3 k m-2 

{zi,Z 2 ,...,Zm- 2 ) ■= ao Y, + ^) +^t +2 +^)- 

fc=0 j=l j=l 


Setting cjj ^2 = oq + + b), then since is .F)-measurable, factorization of gives 

another integral and another factor 1 - a in the denominator. □ 

4.2 Almost sure bounds for AVaR 


Finding an analytical expression for AVaRj (Lt) for the (unsquared) GARCH(1,1) loss is not straight¬ 
forward. It is however possible to derive closed form bounds for AVaR^ {Lt)- 

4.2.1 AVaR-bounds for single day losses 


We now derive a closed form upper bound to AVaR which arises from an application of Jensen’s 
inequality. For the proof of the following Proposition we refer to Appendix [B) 


Proposition 4.7. Let Lt he the loss position at time T > 0 given by the GARCH(1,1) model (2.5). 
Then 


AVaR, 


{^T)-=Y^f^Lz^{y)^yyL'T{aiY^f^L'^l{u)du + b'j, t = 0,...,T-l, (4.10) 


where Vf{-) is given by (3.8), satisfies 


AVaR( < AVaR^ t = 0,..., T - 1. 

An easy alteration of the proof of the previous result yields a closed form lower bound AVaR " for 
A^". 
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Proposition 4.8. Let Lt be the loss position at time T > 0 given by the GARCH(1,1) model (2.5). 
Then 


+ at+i, t = 0, ...T-l, (4.11) 


AVaR! 


satisfies 


AVaRj {Lt) > AVaR ° (Lt) f = 0,..., T - 1. 


4.2.2 AVaR-bounds for aggregated losses 


Unfortunately, for AVaR there exists no result corresponding to Proposition 3.6, hence AVaR does 


not linearize across aggregation of GARCH losses. A key obstacle is that Lemma B.l does not apply. 


However, due to the subadditivity of AVaR and the property (2.4), we can derive an upper bound 
for the aggregation of GARGH losses. 


Proposition 4.9. Let {Lt)f^Q be given by the GARGH(1,1) model ( |2.5[ ). Then, for t e {1,... ,r-1} 
and m € N such that t + m<T, the m-day-ahead AVaR^ of aggregated losses ^t+k is bounded 
by 


AVaR, ( ^ Lt^m+i) < E AVaR“(L,+™.,i). 

k=l k=l 

Proof. By the subadditivity property (2.4) AVaR^( T,T=i ^t+m+i) satisfies 


(4.12) 


AVaR, (E Lt+m+l )<EAVaR, (L 

i+m+l) • 

k=l k=l 


Now the assertion follows from an application of Proposition 4.7 


□ 


Remark 4.10. Due to the lack of linearization across aggregation of GARGH losses, the aggre¬ 
gation of the m-ahead AVaR bounds from Proposition |4.8| do not produce a proper lower bound 
for AVaR, ( -bi+m+i)- Whereas in case of the single m-day-ahead AVaR ° (Lt) is a true lower 

bound to AVaR, (Lt), their aggregation AVaR^ lL^+m+i) is rather a lower bound to the upper 
bound AVaR, (L,+m+i)- It can happen that T.ff-i AVaR f (L,+m,+i) is either an upper bound or 
a lower bound for EfcLi AVaR, (Lt+m+i)- Nevertheless, we will use AVaR“(Lf+m+i) as a weak 
lower bound in our numerical experiments to get an orientation about how much the upper bound 
E™^i AVaR“(L,+m+i) is tailing off. 


5 Extreme value theory based quantile estimation 

5.1 Generalized Pareto Distribution 


Up to now we have not fixed the noise distribution, only assumed certain properties like infinite 
right endpoint or continuity of the distribution function. Throughout we worked with a close to 1 
corresponding to the noise distribution function to be close to 1. Thus it is sufficient to specify the 
distribution function above some high threshold u. This is a typical assumption in extreme value 
theory, and we will apply the Peaks-over-Threshold method (as in McNeil and Frey (2000)). We first 
explain the setting in general. 
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Figure 1; Motorola stock price analysis: loss data (top), conditional variances after fitting a 
GARCH(1,1) model (middle), and residuals of loss data (bottom). 


The Generalized Pareto distribution (GPD) is given by 




l-exp(-^), ^ = 0, 


(5.1) 


where /3 > 0 and ^ e M. If ^ > 0 (5.1) is dehned for x > 0 and if ^ < 0 (5.1) is defined on x e [0,-/3/C]j 
see e.g. Section 3.4 in Embrechts et alT| (1997). Assume that we fix some high threshold u > 0. 
Given a random variable X with distribution function F and right endpoint xp, its associated excess 
distribution function is defined as 


F, 


{y) = ¥{^X-u<y \ X > uj = 


F{y + u) - F{u) 
l-F(n) ’ 


0 < y < xp - u. 


(5.2) 


The strength of the GPD is compressed in a result by Pickands (1975) and Balkema and de Haan 


(1974) which classifies the GPD as the limit distribution of a large class of excess distributions. More 
precisely, under mild conditions there exists a measurable non-negative parameter /3 = /3{u) such that 

lim sup |E„(x) - Gj «(„)! = 0 

U^Xp {)<x<Xf-U 


holds, see Theorem 3.4.13 in Embrechts et al. (1997) for a rigorous statement of this result. The 
density of (5.1) is given by 


9iA^) 


^iexp(-i), { = 0, 
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(5.3) 


































ACF of log returns 




ACF of squared log returns 



Lag 

ACF of squared residuals 



Figure 2: Motorola stock price analysis: sample autocorrelation functions for loss data (top) and 
residuals after fitting a GARCH(1,1) model (bottom). 


Under the assumption that Z has the distribution function which for some high enough threshold 
u > 0 satishes Fu{x) = G^^/ 3 (x) iov ^ < x <xf -u and for some ^ e M and /3 > 0, we find for a > F{u) 
(for ^ = 0 we interpret this quantile as the quantile of the corresponding exponential distribution) 


1 


— OL Ja i — ( 


•e 1-^ 


By (2.7) we obtain 


FzH»)-FzH\{a + l)f -l)) . 


e 

Unfortunately, there is no explicit expression for F^l{y)dy 


(5.4) 

(5.5) 


5.2 Statistical model fitting 


In this section we apply the theory and formulas derived previously to a data set. We choose the his¬ 
torical daily closing prices of the Motorola stock from 1st March 1985 until 15th October 2014 as this 
data set provides several canonical features of financial time series. We transform prices into negative 
log-returns; i.e., into losses, and fit the GARCH(1,1) parameters using Quasi Maximum Likelihood 
Estimation (QMLE) (e.g. Eranq and Zakoian (2010), Chapter 7). The parameter estimates can be 


found in Table and the outcome is depicted in Eigure[2 

We see in the middle plot of Eigure[^ major clustering of volatility in October 1987 (Black Monday), 
in a pronounced period between 2000 until 2002 (Dot-com bubble and wake of 9/11 attacks) and in 
a longer lasting period following the financial crisis between 2008 until 2010. 
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Mean excess plot 




Figure 3: Fit of the Generalized Pareto Distribution. Left: mean excess plot of the positive residuals, 
and QQ-plot of the threshold exceeding residuals against the fitted GPD. Right: excess distribution 
Fu(x-u) from the fitted GPD model (solid line) against the empirical estimates of excess probabilities 
(dotted points). 


Parameter 

Value 

Standard error 

Qq 

2e-07 

1.09e-07 

ai 

0.0451 

0.0014 

b 

0.9531 

0.0013 


Table 1: Estimated GARGH(1,1) parameters by QMLE. 


In a next step we examine the sample autocorrelation functions of the loss data and the residuals 
after fitting a GARCH(1,1) model. In Eigurej^the bottom plots depict the acf of the residuals and 
the squared residuals and is supportive for the our assumption of i.i.d. GARGH residuals Zt. This 
is also reflected in several runs of the Ljung-Box for various lags for the residuals. The residuals also 
pass the augmented Dickey-Euller and the KPSS stationarity tests. 

As explained in Section we fit a GPD to the upper tail of the residuals. We hrst have to choose 
a high enough threshold value u and we choose it as the approximate 92% quantile of the residuals. 
This is supported by studying the mean excess plot of the nonnegative residuals in Eigurej^ the 92% 
quantile of the residuals (solid blue line) yields a threshold which sufficiently marks the beginning of 
the linear behaviour of the mean excess plot. Since the empirical mean excesses are increasing, we 


Q. 

E 


CO 


QQ Plot of Sample Data versus Distribution 



Quantiles of generalized pareto Distribution 


Eigure 4: 


QQ-plot of the threshold exceeding residuals against the fitted GPD. 
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may assume that the shape parameter ^ is positive. This is confirmed by the parameter estimates 
for ^ and /3. The Maximum Likelihood Estimators are ^ = 0.3376 with a 95% confidence interval 
[0.2272,0.4481] and j3 = 0.4609 with a 95% conhdence interval [0.4023,0.5280]. 

In Figure]^ the right hand plot depicts the GPD fit of the excess distribution Fu{x -u) = P(X < x \ 
X > u) superimposed on empirical estimates of excess probabilities. Note how well the GPD model 
fits to the empirical estimates of the excess probabilities. 

A QQ-plot of the empirical quantiles against the fitted quantiles is depicted in Figure]^ Note again 
the good correspondence of the fitted GPD with the empirical estimates. 


5.3 Fitting time-consistent risk measnres to data 

We now compute the corresponding time-consistent risk measures from Sections and 


5.3.1 Time consistent VaR estimation 

In a first step, for a single loss position Lt we compute the m-day-ahead time-consistent VaR given 


by Proposition 3.5 for different levels of a; i.e., we fix t and consider VaRj (Lt+m) for various m e N. 


In Figure]^ we plot VaR^ (Lt+m) for m = 1,..., 10. 

Once the single time consistent risk measures VaR^(Lt+m) are computed, we simultaneously get the 
risk measure of the aggregated losses over m days from Proposition |3.6| by aggregation; i.e., 

m m 

VaR“(EAi,,) = ^VaR“(Li+,). 

i=i i=i 


^t+j) for m= 1,..., 10. Table 0 shows the values of VaR for single 
losses Lt+m and aggregated losses I^t+j for various levels ot a and m= 1,..., 10. 


In Figure [^we plot YaRt {ZjLi Lt+jj 


5.3.2 Time consistent AVaR estimation 


In a second step, we compute the approximate upper and lower AVaR bounds for single loss position 
Lt and we compute the m-day-ahead for different levels of a; i.e., we fix t and consider AVaR“(Lt+m) 
and AVaR^(Lj+m) for various m e N. The risk measure of the aggregated losses over m days we 


obtain from Propositions 4.7 



123456789 10 

days ahead 


Consistent VaR 



Consistent VaR 


Figure 5: Time-consistent VaR estimation for single loss positions m days ahead. 
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Figure 6: Time-consistent VaR estimation for aggregated loss positions m days ahead. 


single loss 


aggr. loss 


aim 


97.5% 


98% 


98.5% 99% 


97.5% 


98% 


98.5% 99% 


1 

2 

3 

4 

5 

6 

7 

8 

9 

10 


0.0064 

0.0068 

0.0072 

0.0076 

0.0080 

0.0084 

0.0089 

0.0094 

0.0099 

0.0105 


0.0069 

0.0074 

0.0079 

0.0085 

0.0091 

0.0097 

0.0103 

0.0110 

0.0118 

0.0126 


0.0077 

0.0083 

0.0091 

0.0099 

0.0108 

0.0117 

0.0127 

0.0138 

0.0150 

0.0162 


0.0088 

0.0099 

0.0111 

0.0125 

0.0140 

0.0156 

0.0175 

0.0196 

0.0218 

0.0244 


0.0064 

0.0132 

0.0204 

0.0280 

0.0360 

0.0444 

0.0533 

0.0627 

0.0726 

0.0831 


0.0069 

0.0143 

0.0222 

0.0307 

0.0398 

0.0495 

0.0598 

0.0708 

0.0826 

0.0952 


0.0077 

0.0160 

0.0251 

0.0350 

0.0458 

0.0575 

0.0702 

0.0840 

0.0990 

0.1152 


0.0088 

0.0187 

0.0297 

0.0422 

0.0562 

0.0718 

0.0893 

0.1089 

0.1307 

0.1551 


Table 2: Values for VaR for single losses Lt+m and aggregated losses ESi Lt 


ajm 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 


lower bound 


97.5% 

0.0098 

0.0115 

0.0134 

0.0157 

0.0183 

0.0214 

0.0250 

0.0291 

0.0340 

0.0398 


98% 

0.0106 

0.0127 

0.0152 

0.0182 

0.0217 

0.0260 

0.0311 

0.0371 

0.0444 

0.0531 


98.5% 

0.0118 

0.0145 

0.0180 

0.0222 

0.0275 

0.0340 

0.0420 

0.0520 

0.0643 

0.0795 


99% 

0.0135 

0.0177 

0.0231 

0.0302 

0.0394 

0.0515 

0.0672 

0.0878 

0.1146 

0.1497 


upper bound 


97.5% 

0.0098 

0.0131 

0.0173 

0.0230 

0.0304 

0.0402 

0.0532 

0.0704 

0.0932 

0.1232 


98% 

0.0106 

0.0146 

0.0199 

0.0271 

0.0370 

0.0504 

0.0687 

0.0936 

0.1276 

0.1738 


98.5% 

0.0118 

0.0168 

0.0239 

0.0340 

0.0483 

0.0686 

0.0974 

0.1384 

0.1966 

0.2792 


99% 

0.0135 

0.0206 

0.0313 

0.0475 

0.0721 

0.1093 

0.1658 

0.2515 

0.3813 

0.5783 


Table 3: Values for AVaR“ and AVaR for single losses Lt+m with m = 1,..., 10. 


5.4 Conclusions 

Obviously, the interpretation for the dynamic time-consistent (A)VaR differs considerably to that of 
the static (A)VaR: the dynamic (A)VaR evolves via the composition of the static (A)VaR over time. 
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ajm 

97.5% 

aggr. AVaR" 

98% 

98.5% 

99% 

97.5% 

aggr. AVaR 

98% 

98.5% 

99% 

1 

0.0098 

0.0106 

0.0118 

0.0135 

0.0098 

0.0106 

0.0118 

0.0135 

2 

0.0213 

0.0233 

0.0263 

0.0312 

0.0229 

0.0252 

0.0286 

0.0341 

3 

0.0347 

0.0385 

0.0443 

0.0543 

0.0402 

0.0451 

0.0525 

0.0654 

4 

0.0504 

0.0567 

0.0665 

0.0845 

0.0632 

0.0722 

0.0865 

0.1129 

5 

0.0687 

0.0784 

0.0940 

0.1239 

0.0936 

0.1092 

0.1348 

0.1850 

6 

0.0901 

0.1044 

0.1280 

0.1754 

0.1338 

0.1596 

0.2034 

0.2943 

7 

0.1151 

0.1355 

0.1700 

0.2426 

0.1870 

0.2283 

0.3008 

0.4601 

8 

0.1442 

0.1726 

0.2220 

0.3304 

0.2574 

0.3219 

0.4392 

0.7116 

9 

0.1782 

0.2170 

0.2863 

0.4450 

0.3506 

0.4495 

0.6358 

1.0929 

10 

0.2180 

0.2701 

0.3658 

0.5947 

0.4738 

0.6233 

0.9150 

1.6712 


Table 4: AVaR" and AVaR^ for aggregated losses ^t+j- 


This results in a much more conservative risk measurement as the risky positions that are due far 
in the future not only enter the risk assessment through their own dynamics at the future maturity 
but rather enter through their risk assessment along any time point up to maturity. This has the 
intended effect that risky effects which arise until maturity are cushioned at any time. As one would 
expect, the higher safety margins a are required the more dramatic is the increase of safety capital 
when more days-ahead risk management is envisioned. 

Table contrasts single and aggregated time-consistent VaR values for different a and maturities m. 
It shows convincingly, how much higher capital reserves are needed to guarantee uniform safety at 
the same level over the whole time to maturity. Already at a level of a = 0.975 the time-consistent 
aggregate loss VaR more than doubles from maturity 1 to 2 and multiplies by a factor of more than 
12 to maturity 10. There is a high price to pay to safeguard against all uncertainties, which may lie 
in the far future. 


For a comparison recall a standard industry method to estimate a 10-day VaR based on the central 
limit theorem, or normality of future losses (e.g. McNeil et al. (2010), Section 2.3.4). Recall that 


the loss from time t over the next m periods can be written as the sum over the negative returns 
during this period. If returns are iid with mean zero and variance (or even normally distributed), 
then this sum is again (approximately) normally distributed with mean zero and variance ma^. This 
motivates the estimation of the sum of losses over m days by the estimation of the 1-day VaR and 
multiply it by \pm. 

Let us compare the values for VaR^ from Table with this industry standard. We find for a = 
0.975 a 10-day VaR of 0.0064^10 = 0.0202 (which we have to compare with the time-consistent 
VaRf = 0.0830, which is more than 4 times as large), and for a = 0.99 a 10-day VaR of 

0.0088 VTl) = 0.0278 (which we have to compare with the time-consistent VaR”(E]2i Ti+j) = 0.1553, 
which is more than 5 times as large). One reason for this huge difference is the well-known fact that 
GARCH losses do not scale with ^m, but scaling depends strongly on the parameters; cf. Franq 


and Zakoian (2010|, Chapter 4. However, this alone does not explain the huge difference between 


the simple industry standard and the time-consistent VaR for the aggregated losses. 


Due to their construction the composed VaR and AVaR for aggregated future losses produce much 
more conservative reserve requirements than the standard VaR and AVaR for the same level of a. As 
an implication the standard reserving requirement of excessively high levels of a like 99% or 99.9% 
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covering 100- or 1000-year events may be put to a test taking into consideration reduced levels of a, 
e.g. in the bandwidth 90% - 97.5%. The reduction of such extremely high levels would also be very 
reasonable from a statistical point of view as lower level quantiles give rise to much more reliable 
estimators. 


A Proofs of Section 3 


Proof of Theorem 3.5 We proceed by backward induction. Firstly, by (3.2), at T - 1 we have the 
1-day-ahead-VaR 

VaR'p_]^(Z/j’) = ax 

which agrees with (3.7) for t = T -1. Assume that ( |3.7[ ) holds for all s = t,..., T - 1. We have 

= VaR“_i(VaR“(LT)) 

= yaRU{F^\a)^Jvl{a,F-Ua) + b)) 

= essinfjm e ^ I ^t-i) ^ «}• 

We denote by the conditional probability with respect to Ft-i- Note that 

-i{Fz\ot)^Vj{aiF-l{a) + b) < m) = Pt-iiFz^afvJ (aiF^l(a) + b) < m^) 
\al,{aiF-^l{a) + - ao + bf \ 


t-i 


Using the definition of the GARCH volatility ( |2.5[ ) for this can be continued by 
¥t-i{Fz\a)^Vf{aiF-l{a) + b) < m) 




1 


{aiF^l{a) + b) 


T-t-l 


m 


FzKa) 


-ao E {aiFz\{a) + h)A-ao-ba‘{\ 
k=o ' 


Since at is .Fi_i-measurable and Zt is independent of we conclude that 
V^t,(LT) = Fz\a) 


’N 


T-t-2 

{ao + {aiF-l{a)+b)a‘^){aiF-l{a) + b)^ * ^+ao ^ (aiF"J(a)-r 6)^ 

fc=o 


T-t-l 


= f; 


(^(q:)\ ao X! {aiF^l{a) + b)^ + a'^{aiF^l{a) + b)'^ ^ 
\ fc =0 




This finishes the proof. 


B Proofs of Section 4 

We need the following lemma. 
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Lemma B.l. For t - 0,... ,T - 2 assume that ft-M-^Misa Ft-measurable, and strictly increasing 
mapping. Then we have 

{uj€n: > YaR'^iftiZl,))} = {uj€n: Zl, > F^l(a)}. 


In particular, 

{a; e O : at+2 > VaR“((Ti+ 2 )} = {w e : Z'j^i > F^l{a)}. 

Proof. Due to the assumptions on ft it is invertible. According to the definition of VaRf we have 

YaR<f{ft{Zl^)) = essinf{m e L^{Ft) ■■ <m\Ft)>a] 

= essinf{m e L^{Ft) '■ P{Z^^i < ff^{m)) > a} 

= essinf{m e LP{Ft) • ^ ck} 

= essinf{m e : fn> ft[F^l{a))^ 

= /ife'(a)) 


where the third line follows from the independence between Zt+i and Ff Thus 

ftiZl,) > YaRtiftiZlt)) = MF-^Uc^)) 

holds if and only if Z^^^ > F^l{a). For the second part, note that by ( |3.4[ ), 

VaR" ( 0 - 4 + 2 ) = \/ao + a‘f_^^{aiF-l{a) + b). 

From the definition of the GARCH (1,1) model (2.5) we conclude 

U 4+2 = ^ao + al^{aiZl^ + b) = ft{Zl^) > YaRt{ft{Zlt)) = \/ao + al^{aiF-l{a) + b) = VaR"(cTt+ 2 ) 


if and only if Zf^^ > F^\{a). 


□ 


Proof of Theorem 4-5 We apply again backward induction. From (4.1) and Example 4.3 we have 


AVaR. 


-i{Lt) = (^tt^ [ Fzl{u)du, 

i. — OL Jol 


which agrees with (4.4) for t = T -1. For simplicity we write K 2 = F^\{u)du. Now assume 

that (4.4) holds for all s = f,..., T - 1. Then it remains to prove (4.4) for f - 1. We have by (4.1) and 


AVaR"_i(L^) = AVaRti(AVaR“(L^)) = AVaR*" 1 (7t2 Vj (01^2 + ^)) • 

We denote Gt ■= Vf {aiF2 + b), which is a measurable function of 0-4+1, and take the constant out of 
the expectation, which yields 

A^“ i( 4 ) = K2 E4„i [Gt I Gt > VaR" i(G4)]. 


Now note that by Definition 3.1 


VaRti(G 4 ) = essinf{m e L^{Ft-i) ■ P(G 4 < m \ Ft-i) > a}. 
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We denote by Pt_i the conditional probability with respect to Tt-\- We compute further, using the 
definition of the GARCH volatility (2.5) for 


T-t-2 

E 

/c-O 


1 —t—Z 

<m) = Pf-i(ao ^ {aiK2 + h)^ + a^^^[aiK2 + b)^ ^ ^ < rri^ 

m-ao EL"o~^(aW2 + b)^ ' 




{aiK2 + b)"^ *■ 

2 /m-ao EL"o"^(«i«2 + bf 


F^) 


(aiK2 + b) 


T-t-l 


- ao - but)-2 )> 

/aiaf ) 


where in the last line we have used that at is Ft-i-measurable and the independence of Zt and J-t-i. 
We can thus conclude that 


T-t-2 


VaR"_i(Gt) = (ao + (aiF^l(a) + ^))(aW2 + 6)^ ‘ ^ + oq XI + b)' 

^ k=0 


From Lemma 


B.l 


we know that {Gt > VaRf(Gt)} = {Zf > F^j{a)}. Hence, it follows from the 
independence of Zt and Ft-i that 

E*_i [Gt I Gt > YaRtiiGt)] = Et-i[Gt \ Z^ > ^ (B-l) 




Moreover, we calculate 

Ei_i[Ga{z2>F^i(„)}] 

/c T~t—1 r 2 

= (l-a)ao y] (aiK2 + 6 ) +(aiK2 + &) / (ao + af{aiu + b))dFz2{u) 

fc=o 

/c T~t—i 

= (l-a)ao X! (^1^2 + ^) +(aiK2 + &) / [ao + a^{aiF^l{u) + b)^du 

R_n O' 


k=0 

T-t-l 


= {l-a)ao X! {aiK2 + by+ [aiK2 + b)'^ * aiK 2 + (1 - a) 6 )(T, 
fc=o 


which in combination with (B.l) yields 


T-t-l 


Et_i[Gf I Gi>VaR“i(Gi)] = ao ^ {ai^2 + b) +{atK2 + b) ^ a^ ^ Vl^{aiK2 + b). 

k=0 


This finally amounts to 


AVaR, 


-i( 4 ) = 7 ^ f^Fz\u)du Et.i[Gt I Gt > VaR“i(Gi)] 

1 - a Ja 

=-E— f Fz^{u)du'P^_i{aiK2 + b), 

1 - a Ja 


which proves the assertion. 


Proof of Proposition \4. 7| A careful proof tracking reveals its similarity to the proof of Theorem 4.5 
For simplicity we set 7c = yj Fz^{y)dy and 7c2 = /J F‘J(y)d?/. 

At t = T - 1 we have AVaR'p_]^(L'r) = PaT which coincides with AVaRy_]^(L 7 ’). Since by Definition 
and (4.8) and (4.9), 

AVaRE2(^r) = AVaRy_2(A^Ei(^T)) = 7 c AVaRr_2(f^T), 
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we obtain 


AVaR^_2('^T) = IEt-2 \ or > VaR^_2(*^T) 

= Er_2 (Tt I ■^T-l > 


by Lemma B.l An application of Jensen’s inequality yields 


Ej ’_2 (Tt I -^T-i > - ^E 2 ’_ 2 j^cr|' I > F^l{a) ^ 


1/2 


We obtain further 


ET- 2 j^< 7 T I ^T-l ^(*^) ~^T-2 Uq *^T -1 (®1 ^T -1 I •^T -1 ^(®) 

= 1^ (ao + 4-i(«i2/ + ^))dFz2(y) 

Z2\ / 

= ao + aT-i(b + ai-^ f ydFz2{y)] 

= ao + cry_i(6 + 01 ^ 2 ), 


which amounts to 


AVaR“ 


T-2 


(Ft) < K,^aQ + cj|,_^^aiK2 + = AVaR^_2(LT). 


This proves for t = T - 2 that AVaR 7 ’_ 2 (Lr) is an upper bound for PNaRrp_ 2 {LT)- 


Now assume that AVaR^ {Ft) ^ AVaR^ {Ft) holds true for s = T - 1,..., t + 1. We show next that 
also 

A^“(Lr) > A^"(Lr). 

To this end notice that 

A^“(Lt) = AVaR"(A^“+i(LT)) < AVaR“(A^“+i(LT)). 

Moreover, we have by the induction assumption 


(B.2) 


AVaR^(AVaR“+i(LT)) = AVaR^(7c2 y ^i+i( F-l{u)du + ft)) 

= K2Et [FFi aiK2 + b] I sJplFFFb) > VaR,”(\/^ 01^2 + 6)) 

By a similar calculation as in the proof of Theorem 4.5 and Lemma |B.l we can see that the above 
expression simplihes to 


AVaR“(AVaR“ i(Lt)) = 7c2 (ai«;2 + 6) I > F^\{q.) 

1 

~ Ja (lEi['Pt+i(aiK2 + b) 

where the last line follows from Jensen’s inequality. Note that 




1 i\R2 
2(«)]) , 


Et 


'P^+i[aiK2 + b) Z^^i>F^l{a) 
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T-t-3 


^ (oiK2 + ^)^ + (ao + <7i\i(oiy + 5))(aifi;2 + 6)^ * ^)dF22(y) 

= oo XI {avi 2 + b)'‘+—^{aiK 2 + b)'^ ^ ‘^(^t+i f {aiF^Hv) + b)dy 
fc=o 1 

= ao X («iK 2+ &)^ +0-^+1 (aiK2 + ^ 

fc=o 

= n^(ai+6), 


which implies 


AVaR^ 


(AVaR“ i(Lr)) < ^ f'Fz\y)dyJv^{aiK2 + b) = AYaR"(L t). 

i — O' Ja ’ 


Finally it follows from (B.2) that AVaR^ {Ft) ^ AVaR^ {Ft)- 


Proof of Proposition f.8 At t = T - 1, AVaR ^ i(Lt) coincides with AVaRji_]^(L 2 ’). At t = T - 2 we 
obtain 


AVaR:r_ 2 (RT) = AVaR! 


T-2 


(AVaR'p_^(L7’)) - -- f F"^^(?/)dy AVaR^_2(cr7’). 

i — O Ja 


For simplicity we write k = F^^{y)dy. By continuity of the distribution function inherited 

from Z, 


AVaR^_2 (^^t) = E'p_2j^cr7’ | ctt > VaR^_2('^T) 


which by Lemma B.l rewrites as 

AVaR^_2(i7T) = IEt-2 | ^t-i ^ F^{o) 
Using the ^r- 2 -measurability of tt-i and ao > 0, we calculate further, 

IEr-2 


CJj' 


Zt-1 > Fz^{a) - ^T-2 \/®0 + crXl(®l'^T-l I F/t-I > F^2{o) 

> tt-i E'r_2[^'y^ aiZ‘^_^ + b \ Z^_i > F^2{o) 

= aT-i-^ f \/(HyTbdFz2{y). 

1-0 JF-lia) 


(B.3) 


Hence, it follows that 


AVaR“ 


-T-2 


{Ft )-^ ^ -j: yJaiF^l{y) + bdyuT-i = AVaR g._ 2 (LT)- 


This proves for t = T - 2 that AVaR y_o(Lr) as in (4.11) is a lower bound for AVaR 2 ’_ 2 (-^T)- 

Now assume that AVaR^ fLr) < AVaR^ {Ft) holds true for s = T - 1,..., t + 1. We show next that 

also 


AVaR” (L t) < AVaRf {Lt). 


To this end notice that 


AVaRt {Lt) = AVaR^(AVaRt+i(Lr)) > AVaR?( AVaR^ , 


(B.4) 
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Moreover, we have by the induction assumption 


AVaR“(MaR“ i(Lt)) = AVaR“(7€CTT-i £ ^aiF-l{y) + bdy) 


= fp-i Vf^iy + b^Pz2{y)) ^t[crt+2 I cru2 > VaR“(cri+2) 


Z2(") 
(a) 


(£- /I 1, , \/aiy + bdFz2{y)) Et\at+2 \ ^t+i > F^lia)) 


where the last equality follows from Lemma B.l Then by the same calculation which lead to (B.3), 

Et au 2 I > F^lia)) = aui yjaiF^l{y) + bdy, 

this yields together with (B.4), 


'(Lr)>K^Y^y' yJaiF^l{y) + bdy^ ut+i = AVaR"(Lr) 


AVaR, 
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